Generic recognition for computer vision is a goal that is still far from reality. Part of the problem rests in the inherent limitations of current "model-based" vision. Our approach moves away from specific geometric or structural models arid instead focuses on the functionality of the object as the property which drives the recognition process. This results in a representation that is generic in the sense of capturing an entire category of objects. One important assumption underlying the form and function approach is that a "small" number of "primitive" concepts about shape, physics and causation will suffice to define the functionality of a broad range of categories. If multiple new "primitives" were required to define each additional cat...